3574 results found.
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
7,000 sentencesProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Self-paced ensemble learning for speech and audio classification
-
Paper track:8.6 Neural network training methods (including new/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Radu Tudor Ionescu | CREMA-D | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY 4.0
Size:
1000 hoursProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
-
Paper track:8.5 Novel neural network architectures (e.g. seque/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Duc Le | LibriSpeech | /N |
Documentation:
Freely available in English
Spoken (ASR) Transcript
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons BY-SA (CC BY-SA)
Size:
None sentencesProduction Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:Phoneme-BERT: Joint Language Modelling of Phoneme Sequence and ASR Transcript
-
Paper track:11.6 Language modeling for conversational speech (/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ayush Kumar | Phoneme-ASR Corpus for Noisy Speech | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
MIT License
Size:
73 GByteProduction Status:
Existing-used
Use:
Machine Learning
-
Paper title:Y^2-Net FCRN for Acoustic Echo and Noise Suppression
-
Paper track:14.11 Acoustic Echo Cancellation (AEC) Challenge/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ernst Seidel | Microsoft INTERSPEECH 2021 AEC Challenge Dataset | /N |
Documentation:
Documentation at Github repository (English)
Aligner,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Apache-2.0 License
Size:
None Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:Hi-Fi Multi-Speaker English TTS Dataset
-
Paper track:7.13 Tools and data for speech synthesis/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Evelina Bakhturina | CTC-Semgnetation | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Public Domain
Size:
24 hoursProduction Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:ADEPT: A Dataset for Evaluating Prosody Transfer
-
Paper track:7.14 Evaluation of speech synthesis/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Alexandra Torresquintero | LJ Speech | /N |
Documentation:
(see URL)
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
non-commercial use
Size:
20 hoursProduction Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:ADEPT: A Dataset for Evaluating Prosody Transfer
-
Paper track:7.14 Evaluation of speech synthesis/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Alexandra Torresquintero | Voice Factory audiobook recordings for Blizzard 2013 | /N |
Documentation:
(see URL)
Speech/Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
195 sentencesProduction Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:ADEPT: A Dataset for Evaluating Prosody Transfer
-
Paper track:7.14 Evaluation of speech synthesis/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Alexandra Torresquintero | ADEPT | /N |
Documentation:
Documentation will be available in English
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
14 GByteProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers
-
Paper track:8.5 Novel neural network architectures (e.g. seque/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Takaaki Hori | Switchboard Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Leveraging non-target language resources to improve ASR performance in a target language
-
Paper track:8.6 Neural network training methods (including new/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jayadev Billa | Wall Street Journal (WSJ) Corpus | /N |
Documentation:
None




